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(54) Long-chain prenyl diphosphate synthase 

(57) The present invention discloses a mutated 
enzyme comprising a geranylgeranil diphosphate syn- 
thase having its origin in wild type Sulfolobus acido- 
caldarius wherein, one of at least phenylalanine at 



position 77, methionine at position 85, valine at position 
99. tyrosine at position 101, phenylalanine at position 
118, arginine at position 199 and aspartic acid at posi- 
tion 312 is substituted with another amino acid. 
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Description 

BACKGROUND OF INVENTION 
1 . Field of Invention 



o 2. Related Art 



ofca^^^ 

K and ubiquinone «5oQ) andWSTSSS dSSSSS£!? ' ** ***** °* vtamh E - Mn 

(DMAPP; C5). gelanyl <^ospha£(Sp^ 

(GGPP ; C20). geranylfarnesy. dphospWe GFPP SK^SSSJ ^WpSffC? *?" Phate 
phate (HepPP; C35) and octaprenyl diphosphate (OPP- C40) a *' , °We (HPP. C30). heptaprenyl diphos- 

ing farnesyl diphosphate aShaS "(SpaJSnt^'w^T^ ^ fl 05 ^ 6 ' 3nd 6X151 in fo ™S- includ- 

synthase (GFp's). LJSfiS££^^ geranylfarnesy. diphosphate 

prenyl diphosphate syrS^OPS) ( 1 heptaprenyl phosphate synthase (HepPS) and octa- 

The carbon chain length and stereoisomerism of synthesized orenvJ riinhncnho^ «~ 

dphosphate synthase (HepPS) (Koike-Takeshi*. A. et al. (1 995) J. Bio) S l^TaS 
ately thermophilic archaebacterium Bacillus ^mth^r^wi , ' 18396 " 184 00) from the moder- 

the hyper Smophilic baCSS^C WST^ ^ ^ 
14797); as well as farnesyl diphosphate/geranylg^ra^^^ 14792 ' 

1 1002-1 1007). Only HepPS can synthesize prenyl diphosphate havino m «Sw*JT' CD ^, 1993) J 6,01 Chem - 2 68. 
stab«ity that synthesize prenyl d^Lph-JL^S^ S!£SS^b2 SET I^T* 
abovementioned HepPS does not have adeauate h^t r^ictan™ !/T reported. In addition, the 

is not always easy ^ ^ resistance ' IS <™POsed of two types of subunits. and handling 



SUMMARY OF INVENTION 



at position 1 18. Arginine residue at position 199 andaLrtic add^u!?, ^2£fSE- ' P^"^ 3 "" 16 resid "e 
Phate synthase of M**^ aag^gk* origin is S add 

thesize prenyl diphosphate having at least 25 cartoon atoms ' ^ en2yme ran *V- 
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Moreover, the present invention provides a gene system that codes for the above-mentioned enzyme, and a proc- 
ess for producing the above-mentioned enzyme using that gene system. 

Furthermore, the present invention provides a process for producing a mutant prenyl diphosphate synthase com- 
prising the steps of culturing a host transformed with a gene in which the codon for phenylalanine residue located at the 
5 fifth N-terminal side position from the N-terminal amino acid of the aspartate-rich domain I in a gene that codes for the 
native enzyme, is converted to a codon for a non-aromatic amino acid, thereby enabling the expression of a mutant 
enzyme that is able to synthesize prenyl diphosphates having a longer chain than the longest chain of prenyl diphos- 
phate synthesized by the native prenyl diphosphate synthase. 

In addition, the present invention provides a process for producing long-chain prenyl diphosphate using the above- 
io mentioned enzyme. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 indicates the mutation site of the present invention in the geranyl diphosphate synthase derived from Sulfolo- 
15 bus acidocaldarius . The arrows in the drawing indicate two aspartate-rich domains. 

Fig. 2 is photograph that indicates the autoradiograph of a thin layer chromatography which shows the products in 
the case of allowing the mutant enzymes of the present invention produced in yeast to act on substrates IPP and (all- 
E)-FPP. The ellipses show the positions of cold authentic samples, which are geraniol, farnesyl. and geraniigeranil for 
a, b and c respectively 

20 Fig. 3 is a photograph that indicates the autoradiograph of a thin layer chromatography which shows the products 
in the case of allowing.the mutant enzyme of the present invention produced in yeast to act on substrates IPP and (all- 
E)-GGPP. The ellipses show the positions of cold authentic samples, which are geraniol, farnesyl, and geraniigeranil for 
a. b and c respectively 

Fig. 4 is a photograph that indicates the autoradiograph of a thin layer chromatography which shows the products 
25 in the case of allowing the mutant enzyme of the present invention produced in E coji to act on (A) substrates IPP and 
DM APR and on (B) substrates IPP and GPR The ellipses show the positions of cold authentic samples, which are gera- 
niol, farnesyl, and geraniigeranil for a. b and c respectively. 

Fig. 5 is the autoradiograph of a photograph that indicates a thin layer chromatography which shows the products 
in the case of allowing the mutant enzyme of the present invention produced in EL cpji to act on (A) substrates IPP and 
30 (all-E)-FPP, and on (B) substrates IPP and (all-E)-GGPP. The ellipses show the positions of cold authentic samples, 
which are geraniol, farnesyl. and geraniigeranil for a, b and c respectively. 

DETAILED DESCRIPTION 

35 As a specific example in the present invention, a geranylgeranyl diphosphate synthase (GGPS) gene of the hyper 
thermophilic archaebacterium, 

Sulfolobus acidocaldarius. is used for the starting material. The cloning method of this gene is described in detail in the 
specification of Japanese Patent Application No. 6-315572. In addition, another example for cloning the gene is 
described in the present specification as Example 1, and a nucleotide sequence and an amino acid sequence encoded 
40 thereby are shown as SEQ ID NO: 1 . 
giij In the present invention, a cloned DNA is mutated in vitro. Although chemical treatment using a mutagen, or phys- 

ical treatment using UV fight or X-rays can be used for the mutation means, chemical treatment is convenient to carry 
out Any routinely used chemical mutagen can be used for the mutagenesis for the present invention, an example of 
which is nitrite. 

45 A specific example of mutagenesis is shown in Example 2. 

The mutagenized DNA is inserted into a yeast expression vector to prepare a DNA library. Any vector that is able 
to express an inserted extraneous gene in the yeast can be used as an expression vector, examples of which include 
a yeast plasmid such as pYEUra3 (available from Qonetech) and pYES2 (available from Invitrogen). 

The resulting plasmid library is introduced into a yeast mutant strain defective for the ability to synthesize hexapre- 
50 nyl diphosphate, which is one of the precursors of coenzyme Q6. Since this mutant strain is unable to synthesize coen- 
zyme 06 necessary for non-fermentative sugar metabolism, it cannot be grown in medium that contains glycerol as the 
sole carbon source. Thus, if the yeast transformedby the above-mentioned library is cultured in glycerol medium and 
the strains that grow are selected, strains can be selected that have acquired the ability to synthesize prenyl diphos- 
phate having a large number of carbon atoms for coenzyme Q synthesis. 
55 Five positive clones were obtained in this manner from approximately 1400 transformants. As a result of purifying 
the plasmids from these clones, determining the nucleotide sequence of the inserted fragment, and predicting amino 
acid sequences that are coded, each mutant had changes in the amino acid sequence as indicated below. 

Mutant 1 : Methionine at position 85 changed to isoleucine, arginine at position 199 changed to lysine, aspartic acid 
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* * 

at position 312 changed to Asn 

Mutant 2: Phenylalanine at position 1 18 changed to leucine 
Mutant 3: Phenylalanine at position 77 changed to serine 

Mutant 4: Phenylanine at position 77 changed to leucine and valine at position 99 Chan**, f„ ™», • 
Mutant 5: Phenylalanine a, position 77 changed to serine and tyrosine^* TS^SSL 

in contrast to wild-type enzymes being unable to synthesize prenyl diphosphate havino at least 25 =, 
enzymes havrngamino acid sequences containing these changeTwere d* MtavSSS55SS52 h ' 
25 or more carbon atoms Those amino acid seouences havinn th* o^,~ « *~ ze prenyl < 1, Pnosphate having 
> shown in SEQ ID NOs: 2 to 6. sequences having the above-mentoned ammo acid substitutions are 

Thus, it can be logically surmised that if an amino acid at any one of the above-mpn«nn<^ 
another amino acid, a prenyl diphosphate having more carbon atoms than £ ^SEXS « 8 ^ 
be synthesized. Thus, the present invention provides a mutant e^eTw^^SS^ w T*™ 
phenylalanine at position 77. methionine at position 85. JtoJEZn wTLt ^? ^ ^ 

position 1 18. arginine at position 199 and a^artic acio at Zlto^ZXZ^^ • V""***** at 
enzyme is able to synthesize prenyl dipho^ate having atSS cart^Smf ** ^ «* 

arorn^^ 

carbon atoms. Thus, in one embodiment ^^^^Z^^S^Z^SZ^ * 
Position 77 is replaced with another amino acid such as serin* i« Jn7™ = least Phenylalanine at 

at enzyme includes enzymes in which i ^ aX^thToS ^ *" 

foned positions. Examples at other amino acid positions include Eat dos^m above-men- 

Thus. the present invention includes enzym^in whilonly SSSS "T*™ ™ ' 
which phenylalanine at position 77 and valine at rrarti™ <w ar / r ~,,Ti^ - s re P ,aced - enzymes in 

77 and tyrosine at posftSTloTare SacS ^n^TnS 7 ^ " ^Phenylalanine at position 

tyrosine a, position^ \ are rSlS^S rS ^ T*" " ^ 
acids at the above-mentioned positions are repLed pnen * la,an,ne at P 05 *™ 77 and one or a plurality of amino 

phate having at least 25 carbon atoms. Thus. flw present inventon "n anomerln^n^^^ ^ diphoS - 
.no*er ernbodinw* Hi. present mention includes ononis taa tot^Sf^ "« " 

rrr,^ast,™r^^^^ 

up to 10. amino aods wrth respect to the amino acid sequences shown in SEQ ID Nos 2 to 6 ^ ° f 

tion upstream to the N-terminal side fromSe N*n£tfrf ^. at | P os * on77 «*»«*«««lwi- 

«* ol the torn** of aspartate** domain I is replaced T *™° 

dipnosph,., . Cr nor*er SS^Z^ " W ^T ^ "* *> f«* 
ing P^SSS^r^^J^^Zl^ "ST - * '«*"' 

.on dom* , „ ^ ^ ff^-^z^ssr^K^zs: 
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for that amino add. 

In addition, the present invention provides a gene coding for the various above-mentioned mutant enzymes, a vec- 
tor comprising that gene, particularly an expression vector, and a host transformed with said vector. The gene (DNA) of 
the present invention can be easily obtained by introducing a mutation into DNA that codes for the wild type amino acid 
5 sequence indicated in SEQ ID NO: 1 . according to routine methods such as site-directed mutagenesis or PCR. 

Moreover, once the amino acid sequence of the target enzyme has been determined, a suitable nucleotide 
sequence that codes for it can be determined, thus making the mutant is possible to chemically synthesize DNA by con- 
ventional DNA synthesis methods. 

In addition, the present invention provides an expression vector comprising the DNA as described above, hosts 
10 transformed with said expression vector, and a process for producing an enzyme or peptide of the present invention 
using these hosts. 

Although expression vectors contain an origin of replication, expression control sequence and so forth, these vary 
according to the host, Examples of hosts include procaryotes. examples of which include bacteria such as con and 
Bacillus sp. including Bacillus subtilus : eucaryotes. examples of which include yeasts such as Saccharomyces sp. 

is including & cerevisiae. and Pichia sp. including Pichia pastoris ; molds, examples of which include Aspergillus sp. such 
as & oryzae and Au niger; animal cells, examples of which include cultured cells and cultured cells of higher animals, 
such as CHO cells. In addition, it is also possible to use plants for the host 

According to the present invention, as indicated in Examples, geranylfarnesyl diphosphate can be accumulated in 
the culture by culturing a host transformed by the DNA of the present invention, and geranylfarnesyl diphosphate can 

20 be produced by recovering it from the culture. Also according to the present invention, geranylfarnesyl diphosphate can 
be produced by allowing the mutant GGPP synthase produced according to the process of the present invention to act 
on the isopentenyl diphosphate substrate and each allylic substrate such as farnesyl diphosphate. 

In an example of using E. coli for the host gene regulation of gene expression is known to exist such as in the proc- 
ess of transcribing mRNA from DNA and the process of translating protein from mRN A. In addition to those sequences 

25 present in nature (e.g. lac, trp, bla, Ipp. P L , P R , ter, T3 and T7 as promoters), sequences in which their mutants (e.g. 
lacUV5) are artificially joined with wild type promoter sequences (e.g. tac, trc) are known as examples of promoter 
sequences that regulate mRNA transcription, and these can also be used in the present invention. 

It is known that the ribosome binding site (GAGG and other similar sequences) sequence and the distance to the 
initiation codon are important as sequences that regulate the activity to translate the mRNA to synthesize proteins. In 

30 addition, it is also well known that the terminator, which commands termination of transcription on the 3'-end (e.g. a vec- 
tor containing rrnPT 1 T 2 is commercially available from Pharmacia), has an effect on protein synthesis efficiency in the 
recombinant. 

Although commercially available products can be used as is for the vector that can be used for preparation of the 
recombinant vector of the present invention, various types of vectors induced according to a specific purpose can also 

35 be used. Examples of these include pBR322, pBR327, pKK223-3, pKK233-2 and pTrc99 f originating in pMB1 and hav- 
ing the replicon, pUC18, pUC19. pUC118, pUC119. pBluescript. pHSG298 and pHSG396, modified to improve the 
number of copies, pACYC177 and pACYC184, derived from p15A and having the replicon, as well as plasmids origi- 
nating in pSC101, Co1E1, R1 and Ffactor. Moreover, expression vectors, for fused proteins, that are easier to purify, 
can also be used, examples of which include pGEX-2T, pGEX-3X and pMal-c2, and the example of a gene used as the 

40 starting material in the present invention is described in Japanese Patent Application No. 6-315572. 

In addition, gene introduction can also be performed by using virus vectors and transposons such as X-phages and 
M13 phages in addition to plasmids. In the case of gene introduction into a microorganism other than E. coli. gene intro- 
duction into Bacillus sp. is known using puB1 10 (sold by Sigma) or pHY300PLK (sold by Takara Shuzo). These vectors 
are described in Molecular Cloning (J. Sambrook, E.F. Fritsch, T Maniatis ed.. Cold Spring Harbor Laboratory Press, 

45 pub.), Cloning Vector (RH. Pouwels, B.E. Enger Valk. W.J. Brammar ed., Elsevier pub.) and various company catalogs. 
Insertion of a DNA fragment coding for GGPP synthase and. as necessary, a DNA fragment having the function of 
regulating expression of the gene of the above-mentioned enzyme, into these vectors can be performed according to 
known methods using suitable restriction enzyme and ligase. Specific examples of plasmids of the invention prepared 
in this manner include pBS-GGPSmut1 . PBS-GGPSmut2, pBS-GGPSmut3. pBS-GGPSmut4 and pBS-GGPSmut5. 

50 Examples of microorganisms that can be used for gene introduction with this type of recombinant vector include EL 
coli and Bacillus sp. This transformation can also be performed according to routine methods such as the Cad 2 method 
or protoplast method described in Molecular Cloning (J. Sambrook, E.F. Fritsch, T. Maniatis ed.. Cold Spring Harbor" 
Laboratory Press pub.) and DNA Cloning Vol. I-IH (D.M. Glover ed., IRL Press pub.). 

In producing the mutant enzyme of the present invention, the above-mentioned transformed cell is cultured after 

55 which the mutant enzyme can be collected and purified from that culture in accordance with routine methods, examples 
of which include salting out organic solvent sedimentation, gel filtration affinity chromatography, hydrophobic inter 
action chromatography and ion exchange chromatography. 

In addition, the present invention provides a process for producing prenyl diphosphate using the enzyme of the 
present invention. In this process, the enzyme of the present invention should be allowed to react in a medium, and par- 




EP 0 763 542 A2 



20 



25 



30 



35 



40 



45 



50 



55 



ticularly an aqueous medium, and then the target prenyl diphosphate should be recovered from the reaction medium 
as desired. The enzyme may not only be purified enzyme, but also crude enzymes obtained by semi^urifSfon 
through various stages, or a substance containing enzymes such as cultured microorganisms or the culture S^n 
addition, the above-mentioned enzyme, crude enzyme or enzyme-containing substance may be an immS^ 
enzyme that has been immobilized in accordance with conventional methods. ""mobilized 

Prenyl diphosphate having fewer carbon atoms than the target prenyl diphosphate, such as 5-20 carbon atoms and 
preferably less than 5 rarbon atoms, and isopenty. diphosphate are used for the substrate. Water or an aqueoSef 
such as phosphate buffer, are used for the reaction medium. "Mueous ouner. 



w EXAMPLES 



. c J he i°"^ in9 Exa : m P les i P rovide a ™™ detailed explanation of the present invention. Furthermore, the materials 

^SI^mTS^S " * Tl a T ^ b/ 3 ^ "* sm the art as describXot 

... S * m " C2 ^" LH3 <* ^ budd,n 9 y^st, Sacchgrgmyc^ ceradsias (Tzagoloff. A. and Dieckmann CL (1990) 
J^ob.o.og.ca. Reviews 54. 21 1 -255. Tzagoloff. A. et al. (1075) J. Bacterid. 122. 82*831). was uWthe^eS 

Plasmid pG3A1 (Tzagoloff. A. and Dieckmann. CL (1990) Miaobiological Reviews 54 21 1-255 Tzanoinff a *t a i 

^ *™ wh,ch P° rfons othef *» ■* HexPS coding region had been removed from pG3Al (A^bJ 
SpS^ne 3 ' (1990) ^ Bi °'- ^ 265 ' 13157 ' 13164 )' was used for the positive control 

firn/i^" ^"2^ L 96ne POrfon h3d bee " rem ° Ved from Y ^ E < Misawa - N etal. (1990) J. Bacteriology 172 

ZJ,T' ] '^, ^i he « eXPreSS '° n VeCt ° r ,0f ,ibrary PWton- Saccharomv^ ~^<L ^ A *j y J' 
used as a wild strain used for the positive control. — TO ds 

However, the experimental materials required for the present invention are not limited to those described above but 
rather completely similar substitutes can also be used. 

Screening host mutant strain C2960-LH3 for screening is a deficient strain for the HexPS gene. In other worts a 
budding yeast HPS gene fragment can easily be obtained from a widely known wild strain of budding yealt by PGR 

£££ T bUdd " 1 ? ^ H6XPS 96ne SeqUGnC6 (^^/EMBL Data Bank accessioTnun^S 

lo' S 96ne ,ra9ment ,s then b * with a yeast incorporating plasmid (Yip) such "3 

P^- PRS4 °S or P RS406 (available from Stratagene). an HexPS^eSTstrain can easi.™ S2i? £22 
conducted gene destruction using homologous recombination. Y wK,ely 

In addrtion. it also sufficient for the positive control plasmid if this gene fragment is inserted using a widely known 
budding yeast, egression vector such as pYEUraS (available from Clonetech) and pYES2 (avaHaWe^^nXS 
The sfrain used for the positive control is not limited to strain A451 . but rather any sLn is suff ident pr^eSSs 
he wrtd HexPS gene. In addition, it is sufficient to use a commercially available vector for the iSn veS for 
library -preparation such as pYEYra3 available from Clonetech or pYES2 available from Invitrogen 

. ^UrftlT^f tay f chr0rnat09ra P hv were ^chased from Whatman Chemical Separation 

Inc. [l-'XjlPP was purchased from Amersham. ^c^iauon. 

Example 1. Plasmid Construction 

D J**"| enzyme sites were introduced both upstream and downstream of the GGPS gene (Gen- 

if nK? fl *"* aCCeSSi0 " numb€r D28748 > <* a^oeald^rius by PGR using to dfeSS- 

thesized DNA primers 5-AAGAGAAGCTTATGAGTTACTTTGAC-3' (SEQ 10 NO- 71 and ^ 

^S^^ST^ ,D . 8> - Gen0mfe ° NA — ^ in accwdance *» ^nemetSs from 
StWobus aCKfocaldanits obtainable as ATCC33909 from the American Type Culture Collection (ATCC) and its done 
DNA was then used for the template DNA of PCR. v^iewion imi ouj. ana its done 

dill JSSlBSlm'KE ^ ^ t0 Hind '" * °» **«c*l W heaved with Hin- 

dlllto form pBS -GGPS. A crtE gene port.on was removed by deaving plasmid Y-crtE with Hindlll and the remainina 

the PGKpromoter and PGK terminator was self-Bgated to form Y-PGK The insert pSt to SSJ 
GGPS gene obtained by severing P BS-GGPS with Hindlll was introduced at the Hindlll ate of Y-PGK to form YGGPS 

Example 2. Random Mutagenesis of GGPS Gpjib 

u A ra . nd ,T, mUtati f IT introduced 1,110 reaion coding for GGPS gene using nitrite according to the method of 

?5£f 'It 1985 | i Sdence ^ 242 - 247 )- Si «9'« strand DNA was isdated fromV ooJSSSJ 
PBS-GGPS by uifection with helper phage M13K07. and this was then treated for 60 minutes with 1 M ^dium nrtrite 
Next the complementary strand was synthesized as primer using chemical synthesis DNA 5'-CCCCCCTCGAGGTC 
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GACGGTATCGATAA-3' (SEQ ID NO: 9) corresponding to the sequence of the T7 promoter portion. The GGPS gene 
portion was then extracted with Hindlll restriction enzyme, introduced at the HindlH site of Y-PGK. and transformed to 
EL GQli strain XU-Blue to prepare the library. 

5 Example 3. Yeast Transformation and Screening 

The budding yeast, Saccharomvces cerevisiae. was transformed by the spheroplast method according to the 
method of Ashby et al. (Ashby, M.N. and Edwards. RA (1990) J. Biol. Chem. 265, 13157-13164). Namely, HexPS-deft- 
cient strain C296-LH3 was transformed with the previously described plasmid library and cultured on leucine-deficient 
w agar plate (leu* plate) using the top agar method (3% bactoagar, 0.67% yeast nitrogen base. 0.05% yeast extract, 
0.05% bacto peptone, 1 .0 M sorbitol and 2% glucose). 

The transformant having the Leu + phenotype was inoculated onto YEPG (1% yeast extract, 2% ethanol, 2% bacto 
peptone and 3% glycerol), D (1% yeast extract, 2% ethanol, 2% bacto peptone, 3% glycerol and 0.1% glucose) and 
YPD (1% yeast extract, 2% bacto peptone and 2% glucose) agar media followed by incubation for 3 days at 30°C. 
15 Clones were selected from the C2960-LH3 transformants with plasmid containing mutated GGPS that grew on the 
YEPG agar plate, grew and formed colonies larger than those of non-transformed C296-LH3 on the D plate. 

This complemented phenotype is considered to indicate that the electron transport chain is active during the respi- 
ration reaction, or in other words, that a active coenzyme Q was synthesized in the C296-LH3 cells. Five clones having 
' this complemented phenotype were obtained from 1 ,400 transfbrmarrts. As a result of retesting the resulting five clones. 

[ 20 not only were they able to grow on YEPG agar plates, but they also possessed the ability to form colonies that were 

;! clearly larger than those of YEpG3ASpH/C296-LH3, having a plasmid that contains HexPS gene of yeast origin, on D 

agar plates. The plasmid DNA of these five clones were purified in accordance with routine methods. 

These plasmids were named Y-GGPSmut1 , y-GGPSmut2, Y-GGPSmut3, Y-GGPSmut4 and Y-GGPSmut5. 
Furthermore, since yeast strain C296-LH3 is deficient in HexPS activity, it is unable to biosynthesize coenzyme Q6 
25 which has a hexaprenol group on its side chain. Since coenzyme Q6 is required for non-fermentative metabolism, 
« C296-LH3 forms colonies on media containing a small amount of glucose that are smaller than those of the wild strain, 

and does not grow on media that only contains a non-fermentative substrate like glycerol for the carbon source. Prior 
to screening for mutated activity, the effects of expression in wild type GGPS derived from Sulfolobus addocaldarius 
'* were investigated: 

i 30 On the D plates, strain Y-GGPS/C296-LH3, which is strain C296-LH5 having a plasmid containing the wild type 

\ GGPS gene, was found to clearly form colonies smaller than those of YEpG3ASpH/C296-LH3 although intermediate to 

* YEpG3ASpH/C296-LH3. possessing a plasmid containing HexPS gene of yeast origin, and C296-LH3, not possessing 

| a plasmid. However. Y-GGPS/C296-LH3 was unable to grow on the YEPG plate. This screening method was therefore 

confirmed to be useful. 
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Example 4. Determination of DNA Nucleotide Sequence and its Analysis 
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The nucleotide sequences of DNA coding for the five mutant GGPS contained in the five types of purified plasmids 
were determined using the Perkin-Elmer Model 373 A Fluorescent DNA Sequencer according to the dideoxy chain ter- 
3 40 mination method. Analysis of the nucleotide sequences was performed using the genetic data analysis software, Mac- 

MollyTetra. 

The amino acid substitution sites as deduced from the nucleotide sequence of each mutant GGPS are shown in 
Fig. 1 . Replacement sites were found at the nucleotide sequence level for all selected mutants. In the case of Mutant 1 
which is the YGGPSmutl insertion fragment, replacements were found consisting of mutant methionine at position 85 
45 changing to isoleucine, mutant arginine at position 199 changing to lysine, and mutant aspartic acid at position 312 
changing to asparagine. In the case of Mutant 2 which is the YGGPSmut2 insertion fragment, the only replacement 
was mutant phenylalanine at position 1 18 changing to leucine. In the case of Mutant 3 which is the Y-GGPSmut3 inser- 
tion fragment, mutant Phe at position 77 changed to serine, in the case of Mutant 4 which is the Y-GGPSmut4 insertion 
fragment, mutant phenylalanine at position 77 changed to leucine and mutant valine at position 99 changed to methio- 
50 nine, and in the case of Mutant 5 which is the Y-GGPSmut5 insertion fragment mutant phenylalanine at position 77 
changed to serine and mutanliyiosine at position 101 changed to histidine. 

A high proportion of these mutations consist of an aromatic amino acid residue being replaced with a non-aromatic 
amino acid residue. Phenylalanine at position 77 in particular has the most significant effect on the chain elongation 
reaction. Phenylalanine at position 77 is located at the five residues upstream from the N-terminal residue of an aspar- 
55 tate-rich domain I. There are two aspartate-rich domain motifs (DDXX(XX)D) that are conserved in prenyl transferase. 
The diphosphate portion of the substrates are believed to bind here. The amino acid residue located at the fifth position 
upstream from the N-terminal residue of this aspartate-rich domain, which was focused on for the first time in the 
present invention, is considered to be extremely important in determining the chain length of the reaction product 
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A crude extract was prepared from the five selected clones {Y-GGPSmut1/C296-LH3 Y-GGPSmut2/C296-LH3 Y- 
GGPSmut3/C296-LH3. YGGPSmut4/C296-LH3 and Y-GGPSmut5/C-296-LH3) according to the method of Itoh et al 
(Itoh. N. et al. (1984) J Biol. Chem. 259. 13923-13929). 

Namely, the above-mentioned yeast was incubated for 4 days at 30°C. Approximately 400 ug of cells were collected 
by centrffugation and washed once with 800 uJ of buffer A (50 mM Tris HQ pH 7.5. 5 mM MgQ 2 . 50 mM dithiothreitol 
1 M sorbitol). The cells were then suspended in 1 .2 mM butter B (50 mM Tris HQ pH 7.5. 5 mM MgQ 2 . 3 mM dithioth- 
reitol. 1 M sorbitol) followed by the addition of 0.8 mg of zymolyase and incubation for 1 hour at 30°C. 

The prepared spheroblasts were washed three times with buffer B and suspended in 1 ml of buffer C (50 mM Tris 
HQ pH 7.0. 10 mM 2-mercaptoethanol, 1 mM phenylmethanesulfonyl fluoride. 1 mM EDTA). Ultrasonic treatment was 
performed 10 times on the suspension in ice at two minute intervals, performing treatment for 10 seconds at a time at 
maximum output using a Branson Sonifier. The lysates were incubated for 1 hour at 55°C, and after inactivating prenyl 
transferase^) of the host cells, the lysates were centrifuged for 10 minutes at 10.000 x g. The resulting supernatant was 
used as a mutant GGPS crude enzyme solution and assay of prenyl transferase activity 

The results of performing an assay of prenyl transferase activity by LKC-18 thin layer chromatography using this 
mutant GGPS crude enzyme liquid prepared from yeast are shown in Figs. 2 and 3. 

After carrying out the enzyme reaction at 55°C. polyprenyl diphosphate was extracted with 1-butanol after which 
the 1-butanol was evaporated with a nitrogen gas flow. The resulting polyprenyl diphosphate was treated with acid 
phosphatase in accordance with the method of Fujii et al. (Fujii et al. (1982) Biochim. Biophys. Acta. 712. 716-718) The 
hydrolysis product was extracted with pentane and after performing thin layer chromatography using acetone/H^ (9 1) 
for the developing solution, the distribution of radioactivity was analyzed with the Fuji Rim Model BAS2000 Bio-image 
Analyzer. The alcohols as the authentic standards, on which thin layer chromatography was performed simultaneously, 
followed by staining with iodine vapor (geranyol. farnesol. geranylgeraniol). were used to determine the developing loca- 

Fig. 2 shows the result of reacting using [1 - 14 C]IPP and (all-E)-FPP for the substrates, while Fig. 3 shows the result 
of reacting using [1- 14 C]IPP and (all-E)<3GPP for the substrates. Spots a through c correspond to the authentic stand- 
ard samples, a indicating geraniol, b indicating (all-E)-farnesol. and c indicating (all-E)-geranylgeranyoL Ori indicates 
the sample-stopping point. S.F indicates the solvent front. 

On the basis of these results, in the case of using GGPP for the allylic substrate, it was shown that each mutant 
GGPS is able to synthesize geranyttarnesyl diphosphate (GFPP) that is one isoprene unit longer than the reaction prod- 
uct of the wild type enzyme. On the other hand, the wild type GGPS is unable to synthesize the reaction product same 
as or longer than the chain length of GFPP at a level that allows detection. In the case of using FPP for the allylic sub- 
strate, the product ratio of GGPP/GFPP indicated by the mutant GGPS was different from each other. 

Example 6. Preparation of Mutant GGPS from E. cofi 

— ». . _ . 

In order to ensure that the analysis is performed more accurately, each mutant GGPS was over expressed in E coli 
strain of XL 1 -Blue. Namely, each of the five plasmids Y-GGPSmut1 , Y-GGPSmut2, Y-GGPSmut3, Y-GGPSmut4 and Y- 
GGPSmut5 obtained in screening was digested with Hindlll to obtain Hindlll DNA fragments that code tor the mutant 
GGPS. These Hindlll DNA fragments were inserted at the Hindlll site of the plasmid vector pBluescript (KS(+)) to obtain 
pBS-GGPSmutl. pBS-GGPSmut2. pBS-GGPSmut3, pBS<5GPSmut4 and pBS-GGPSmut5 respectively. 

£. coli XL1 -Blue was transformed with pBS-GGPSmut1 . pBS-GGPSmut2. pBS-GGPSmut3, pBS-GGPSmut4 and 
pBS-GGPSmut5 and cultured according to the method described in Molecular Qoning (Sambrook, J. et al (1989) 
Molecular Qoning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory. Cold Spring Harbor. New York) After 
collecting the bacterial cells, the bacterial cells were ultrasonically homogenized in 50 mM Tris HQ buffer containing 10 
mM 2-mercaptoethanol and 1 mM EDTA. After heat treating the homogenate for 1 hour at 55°C, it was centrifuged for 
10 minutes at 100,000 x g. The supernatant was then collected as the crude enzyme solution which was assayed for 
PTase activity. 

Assay was performed by analysis of product with LKC-18 thin layer chromatography and by determination of 
enzyme activity. For thin layer chromat ography , DMAPP, GPP. (all«E)-FPP and (all-E)<3GPP were used for the allylic 
substrates, and after reacting in the same manner as Example 5, LKC thin layer chromatography was performed in the 
same manner as Example 5. Those results are shown in Figs. 4 and 5. 

Fig. 4(A) is the result of reacting [1- 14 C]IPP with DMAPP for the substrate, and (B) is the result of reacting [1- 
qiPP with GPP for the substrate. Fig. 5(C) is a result of reacting [1 - 14 C]IPP with (all-E)-FPP for the substrate while 
(D) is a result of reacting [1- 14 C]IPP with (aII-E)-GGPP for the substrate. Ellipses a through c show the positions of the 
authentic standard samples, a indicating geraniol. b indicating (all-E)-ferneso! and c indicating (a!l-E)-geranylgeraniol 
Ori indicates the sample-spoting point while S.F. indicates the solvent front. 

The prenyl transferase activity was assaied as follows. 1 ml of assay mixture, containing 25 nmol of (1- 14 C]IPP (37 
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GBq/mol). 25 nmol of allytic substrate (DMAPP, GPP, (all-E)-FPP or (all-E)-GGPP). 5 umol of MgCI 2 . 10 *imol of phos- 
phate buffer (pH 5.8) and the enzyme solution, was incubated for 60 minutes at 55°C. 

The reaction was stopped by cooling rapidly on ice. After adding 3.5 ml of water-saturated 1-butanol to the chilled 
mixture and shaking vigorously, the 1-butanol layer was washed with NaCI -saturated water and 14 C radioactivity was 
measured with a liquid scintillation counter. 1 unit of enzyme activity was defined as the amount for which 1 nmol of [1 - 
14 C]IPP is incorporated into elongated prenyl diphosphate (polyprenyl diphosphate) that can be extracted with the 1- 
butanol layer. Those results are shown in the Table. 



Table 



Substrate 


Relative Activ- 
ity (dpm) 


Product Distribution 






GPP 


FPP 


GGPP 


GFPP 


FFPP 






% 


% 


% 


% 


% 


Mutant 1 














DMAPP 


31.800 


23.2 


8.77 


29.6 


38.0 


0.45 


GPP 


5.260 


nd* 


38.8 


30.9 


30.4 


0.02 


FPP 


4.340 


nd* 


nd* 


'65.1 


35.0 


nd* 


- GGPP 


998 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 2 














DMAPP 


15.800 


1.44 


0.66 


89.0 


8.86 


nd* 


GPP 


7.050 


nd* 


20.3 


74.9 


4.89 


nd* 


FPP 


6.080 


nd* 


nd* 


89.5 


10.5 


nd* 


GGPP 


379 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 3 














DMAPP 


24.900 


3.40 


27.4 


16.6 


51.6. 


0.92 


GPP 


9.890 


nd* 


64.7 


9.37 


24.5 


1.44 


FPP 


7.280 


nd* 


nd* 


30.4 


69.6 


nd* 


GGPP 


3.200 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 4 














DMAPP 


16.700 


4.93 


4.07 


73.2 


17.8 


nd* 


GPP 


7.460 


nd* 


38.4 


51.3 


10.3 


nd* 


FPP 


5.650 


nd 


nd 


85.9 


14.1 


nd* 


GGPP 


551 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 5 














DMAPP 


23.600 


271 


18.6 


12.8 


40.4 


1.12 


GPP 


9.070 


nd* 


59.3 


13.0 


26.1 


1.56 


FPP 


8.960 


nd* 


nd* 


32.0 


68.0 


nd* 


GGPP 


2.200 


nd* 


nd* 


nd* 


100 


nd* 


Wild type 














DMAPP 


13.600 


5.61 


0.43 


94.0 


nd* 


nd* 


GPP 


6.640 


nd* 


17.2 


82.8 


nd* 


nd* 


FPP 


4.650 


nd* 


nd* 


100 


nd* 


nd* 


GGPP 


nd* 


nd* 


nd* 


nd* 


nd* 


nd* 


nd: Not detected 
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Each mutant GGPS exhibited activity that synthesizes pdyprenyl diphosphate having a longer chain length than 
GGPR The wild type GGPS as well as each mutant GGPS reacted the best with DMAPP amongst the four allylic sub- 
strates. In addition, relative activity when allylic substrates were used that had a shorter chain length than FPP exhibited 
similar values. However, relative activity and product distribution when GGPP was used for the allylic substrate were 
considerably different 

When DMAPP. GPP and FPP were used for the allylic substrates. Mutant 1. which is coded for by the insert DNA 
of plasmid pBS-GGPSmutl. yielded the major reaction products of GFPP and GGPR In particular, when DMAPP was 
used for the allylic substrate, only a slight amount of hexaprenyl diphosphate (HexPP) was detected as the reaction 
product. Although the distribution of reaction products varied between each allylic substrate, the proportion of product 
produced in one cycle of the condensation reaction was large. 

In the case of Mutant 2 coded for by the insert DNA of plasmid pBS-GGPSmut2. the major product was GGPP and 
the proportion of GFPP was roughly 10%. HexPP was not detected. 

Mutant 3. which is coded for by the insert DNA of plasmid P BS^GGPSmut3. and Mutant 5. which is coded for by the 
insert DNA of plasmid pBS-GGPSmut5, demonstrated similar characteristics. These mutants exhibited strong GFPP 
synthesis activity, while also synthesizing a small amount of HexPP. 

Mutant 4. which is coded for by the insert DNA of plasmid pBS-GGPSmut4 f yielded GGPP as the major product 
while the proportion of GFPP was roughly 15%. FPP was effectively synthesized when GPP was used for the alMic 
substrate. . 7 
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SEQUENCE LISTING 

SEQ ID NO i 1 
Sequence Length: 993 
Sequence Type: Nucleic acid 
Strandness: Double strand 
10 Topology: Linear 

Molecular Types Genomic DNA 
Source 

Organism: Sulfolobus acidocaldarius. 
Sequence 

ATG AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 48 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Clu He Val Asn Ser Val Asn 
20 5 10 15 

GAC ATT ATT AAG AGC TAT ATA TCT CGA GAT GTT CCT AAA CTA TAT GAA 96 
Asp lie He Lys Ser Tyr He Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 25 30 

GCC TCA TAT CAT TTG TTT ACA TCT CGA GGT AAG AGC TTA AGA CCA TTA 14 4 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 40 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 192 
He Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 
50 55 60 

35 TAT TAT GCA GGT GCA GCT ATT GAA GTT CTT CAT ACT TTT ACG CTT GTG 240 

Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Phe Thr Leu Val 
65 70 75 60 

'0, 40 v CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC AGA AGA GGG TTA CCC ACA 288 

His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

GTC CAC GTG AAA TAC GGC TTA CCC TTA GCA ATA TTA GCT GGG GAT TTA 336 
Val His Val Lys Tyr Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 HO 

CTA CAT CCA AAG GCT TTT CaG CTC TTA ACC CAG GCT CTT AGA GGT TTG 36 4 
Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 
115 120 --- 125 



30 



- -2.51 



45 



50 



55 
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CCA AGT CAA ACC ATA ATT AAG CCT TTC GAT ATT TTC ACT CGT TCA ATA 
Pro Ser Clu Thr lie He Lys Ala Phe Asp lie Phe Thr Arg Ser He 

130 135 Ho 

ATA ATT ATA TCC CAA CCA CAC CCA GTA CAT ATG CAA TTT GAG G A C AG A 
lie He He Ser Clu Gly Gin Ala V a l Asp Met Clu Phe Clu Asp Arg 
X « 5 "0 155 * i 60 

ATT GAT ATA AAG GAG CAG GAA TAC CTT GAC ATG ATC TCA CGT AAG ACA 
He Asp He i ys clu Gin Glu Tyr Leu Asp Met He Ser Arg Lys Thr 

1*5 170 175 

GCT CCA TTA TTC TCG GCA TCC TCA AGT ATA GGC OCA CTT ATT GCT GGT 
Ala Ala Leu Fhe Ser Ala Ser Ser Ser He Gly Ah Leu He Ala Gly 

180 185 190 

GCT AAT GAT AAT GAT GTA AGA CTG ATG TCT GAT TTC GGT ACG AAT CTA 
Ala Asn Asp Asn Asp Val Arg Leu Met Ser Asp Phe Gly Thr Asn Leu 

195 200 205 

GGT ATT GCA TTT CAG ATT GTT GAC GAT ATC TTA GGT CTA ACA GCA GAC 
.Gly He Ala Phe Gin He Val Asp Asp He Leu Cly Leu Thr Ala Asp 

210 215 220 

GAA AAG GAA CTT GCA AAC CCT GTT TTT AGT GAT ATT AGG CAG GGT AAA 
Glu Lys Glu Leu Gly Lys Pro Val Phe Ser Asp He Arg Clu Gly Lys 
225 230 235 ZA0 

AAG ACT ATA CTT GTA ATA AAA ACA CTG GAG CTT TGT AAA GAG GAC CAG 
Lys Thr He Leu V a l He Lys Thr Leu Clu Leu Cys Lys Glu Asp Glu 

2 « 5 250 255 

AAG AAG ATT GTC CTA AAC CCG TTA GGT AAT AAG TCA GCC TCA AAA GAA ' 
Lys Lys He VaL Leu Lys Ala Leu Gly Asn Lys Ser Ala Ser Lys Glu 

260 265 2 70 

GAA TTA ATG AGC TCa GCA GAT ATA ATT AAG AAA TAC TCT TTA GAT TAT 
Glu Leu Met Ser Ser Ala Asp He He Lys Lys Tyr Ser Leu Asp Tyr 

275 280 285 

CCA TAC AAT TTA GCA GAG AAA TAT TaT AAA AAT GCT ATA GAC TCT TTA 
Ala Tyr Asn Leu Ala Clu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

290 295 300 

AAT CAA GTC TCC TCT AAC AGT GAT ATA CCT GGA AAG GCT TTA AAA TAT 
Asn Cln Val Se? Ser Lys Ser Asp He Pro Cly Lys Ala Leu Lys Tyr 

305 310 ,u 

315 320 



432 



480 



528 



576 



624 



672 



720 



76fl 



816 



864 



912 



960 
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CTA CCT GAA TTT ACG ATA AGA AGG AGa AAA TAA 
Leu Ala Glu Phe Thr lie Arg Arg Arg Lys TER 

325 330 

SEQ ID NO: 2 
Sequence Length: 993 
Sequence Type: Nucleic acid 
Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG ACT TAC TTT CaC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu lie Val Asn Ser Val Asn 

5 10 15 

GAG ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT CCT AAA CTA TAT GAA 
Asp He lie Lys Ser Tyr lie Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 25 30 

GCC TCA TAT CAT TTG TTT ACA TCT GGA GGT AAG AGG TTA AGA CCA TTA 
Ala Ser Tyr His Leu Phe Thr Scr Gly Gly Lys Arg Leu Arg Pro Leu 

35 40 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 
lie Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 55 60 

TAT TAT GCA GGT CCA GCT ATT GAA GTT CTT CAT ACT TTT ACG CTT GTQ 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Phe Thr Leu Val 
65 70 75 80 

CAT GAT GAT ATT ATA GAT CAA CAT AAT aTC AGA AGA GCG TTA CCC ACA 
His Asp Asp He He Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

CTC CAC CTG AAA TAC GGC TTA CCC TTA CCA ATA TTA GCT GGG CAT TTA 
Val His Val Lys Tyr Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 110 

CTA CAT GCA AAG GCT TTT CAG CTC TTA ACC CAG GCT CTT AGA GGT TTG 
Leu His Ala Lys ALa Phe Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 
115 120 125 

1 
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CCA ACT GAA ACC ATA ATT AAC GCT TTC GAT ATT TTC ACT CCT TCA ATA 
Pro Ser Glu Thr He He Lys Ala Phe Asp He Phe Thr Arg Ser He 

130 135 1*0 

ATA ATT ATA TCC GAA GGA CAG GCA GTA GAT ATG CAA TTT GAG GAC AGA 
He He He Ser Glu Gly Gin Ala Val Asp Met Glu Phe Glu Asp Arg 
145 150 155 160 

ATT GAT ATA AAG GAG CAC GAA TAC CTT GAC ATG ATC TCA CCT AAG ACA 
He Asp He Lys Glu Gin Glu Tyr Leu Asp Met He Ser Arg Lys Thr 

165 170 175 

GCT CCA TTA TTC TCC GCA TCC TCA ACT ATa GGC GCA CTT ATT GCT CCT 

m 

Ala Ala Leu Phe Ser Ala Ser Ser Ser He Gly Ala Leu He Ala Gly 

180 185 190 

GCT AAT GAT AAT GAT GTA AAA CTG ATG TCT CAT TTC GGT ACG AAT CTA 
Ala Asn Asp Asn Asp Val Lys Leu Ket Ser Asp Phe Gly Thr Asn Leu 

195 200 205 

GGT ATT GCA TTT CAG ATT GTT GAC GAT ATC TTA GGT CTA ACA GCA GAC 
Gly He Ala Phe Gin He Val Asp Asp He Leu Gly Leu Thr Ala Asp 

210 215 220 

GAA AAG GAA CTT CGA AAG CCT GTT TTT ACT GAT ATT aGG GaG GGT AAA 
Glu Lys Glu Leu Gly Lys Pro Val Phe Ser Asp lie Arg Glu Gly Lys 
225 230 235 240 

AAG ACT ATA CTT GTA ATA AAA ACA CTG GAG CTT TGT AAA GAG GAC GAG 
Lys Thr He Leu Val He Lys Thr Leu Glu Leu Cys Lys Glu Asp Glu 

245 230 255 

AAC AAG ATT GTC CTA AAG GCC TTA GGT AAT AAG TCA CCC TCA AAA GAA 
Lys Lys He Val Leu Lys Ala Leu Gly Asn Lys Ser Ala Ser Lys Glu 

260 265 270 

GAA TTA ATG AGC TCA CCA GAT ATA ATT AAG AAA TAC TCT TTA CAT TAT 
Glu Leu Met Ser Ser Ala Asp He He Lys Lys Tyr Ser Leu Asp Tyr 

275 280 285 

GCA TAC AAT TTA GCA GAG AAA TAT TAT AAA AAT GCT ATA GAC TCT TTA 
Ala Tyr Asn Leu Ala Glu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

290 295 300 

AAT CAA GTC TCC TCT AAG AGT AAT ATA CCT GGA AAG GCT TTA AAA TAT 
Asn Gin Val Ser Ser Lys Ser Asn He Pro Gly Lys Ala Leu Lys Tyr 
305 ^ 310 315 320 
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CTA CCT GAA TTT ACG ATA AGA AGC AGA AAA TAA 
Leu Ala Glu Phe Thr lie Arg Arg Arg Lys TER 

325 330 

SEQ ID NO: 3 
Sequence length: 9 93 
Sequence Type: Nucleic acid 
Strandness : Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG ACT TAG TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu He Val Asn Ser Val Asn 

5 10 15 

GAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT CCT AAA CTA TAT GAA 
Asp He He Lys Ser Tyr He Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 25 30 

CCC TCA TAT CAT TTG TTT ACA TCT GGA GGT AAG AG G TTA AGA CCA TTA 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 AO 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GaA ACA GCT 
He Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 55 60 

TAT TAT CCA GGT GCA CCT ATT GAA GTT CTT CAT ACT TTT ACG CTT GTG 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Phe Thr Leu Val 
65 70 75 SO 

CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC AGA AGA GGG TTA CCC ACA 
His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

65 90 95 

GTC CAC GTG AAA TAC GGC TTA CCC TTA GCA ATA TTA GCT GGG GAT TTA 
Val His Val Lys Tyr Gly Leu Pro Leu Ala lie Leu Ala Gly Asp Leu 

100 105 HO 

CTA CAT GCA AAG GCT CTT CAC CTC TTA ACC CAG GCT CTT AGA GGT TTC 
Leu His Ala Lys Ala Leu Gin Leu Leu Thr Gin Ala Le u Arg Cly Leu 
115 120 125 
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15 



20 



25 



30 



35 



: * 



40 



45 



SO 



CCA ACT GAA ACC ATA ATT AAG CCT TTC CAT ATT TTC ACT CGT TCa ATA 
Pro Ser Glu Thr He He Lys Ala Phe Asp He Phe Thr Arg Ser He 

130 13S 1UQ 

ATA ATT ATA TCC GAA CCA CAG CCa GTA GAT ATG GAA TTT GAG GAC AGA 
He He He Ser Glu Gly cin Ala Val Asp Met Glu Phe Glu Asp Arg 
I* 5 150 155 - 160 

ATT GAT ATA AAG GAG CAG GAA TAC CTT GAC ATG ATC TCA CGT AAG ACA 
He Asp He Lys Glu Gin Glu Tyr Leu Asp Met He Ser Arg Lys Thr 

165 170 17S 

GCT CCA TTA TTC TCG GCA TCC TCA ACT ATA CGC GCA CTT ATT CCT GGT 
Ala Ala Leu Phe Ser Ala Ser Ser Ser He Gly Ala Leu He Ala Gly 

130 185 190 

GCT AAT GAT AAT GAT GTA AGA CTC ATG TCT CAT TTC GGT ACG AAT CTA 
Ala Asn Asp Asn Asp Val Arg Leu Met Ser Asp Phe Gly Thr Asn Leu 

195 200 205 

GGT ATT GCA TTT CAG ATT GTT GAC GAT ATC TTA GGT CTA ACA GCa GAC 
Gly He Ala Phe Gin He Val Asp Asp He Leu Gly Leu Thr Ala Asp 

210 ' 215 220 

CXA AAG GAA CTT CCA AAG CCT GTT TTT AGT GAT ATT AGG CAG GGT AAA 
Glu Lys Glu Leu Gly Lys Pro Val Phe Ser Asp He Arg Glu Gly Lys 
225 - 230 235 240 

AAG ACT ATA CTT CTA ATA AAA ACA CTG GAG CTT TGT AAA CAG GAC GAG 
Lys Thr He Leu Val He Lys Thr Leu Glu Leu Cys Lys Glu Asp Glu 

245 250 255 

AAG AAG ATT GTC CTA AAG GCG TTA GGT AAT AAG TCA GCC TCA AAA GAA 
Lys Lys He Val Leu Lys Ala Leu Gly Asn Lys Ser Ala Scr Lys Glu 

260 265 270 

GAA TTA ATG AGC TCa GCA CAT ATA ATT AAG AAA TAC TCT TTA GAT TAT 
Glu Leu Met Ser Ser Ala Asp He He Lys Lys Tyr Ser Leu Asp Tyr 

275 280 28S 

GCA TAC AAT TTA GCA GAG AAA TAT TAT AAA AAT CCT ATA GAC TCT TTA 
Ala Tyr Asn Leu Ala Glu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

290 295 300 

AAT CAA GTC TCC TCT AAG AGT GAT ATa CCT GGA AAG GCT TTA AAA TAT 
Asn Cln Val Ser Ser Lys Ser Asp He Pro Gly Lys Ala Leu Lys Tyr 
30 ?- 31° 315 320 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 
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CTA GCT GAA TTT ACG ATA AGA AGG AGA AAA TAA 
Leu Ala Glu Phe. Thr lie Arg Arg Arg Lys 

325 330 

SEQ ID NO: 4 
Sequence Length: 333 
Sequence Type: Nucleic acid 
Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu He Val Asn Ser Val Asn 

5 10 15 

GAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT CCT AAA CTA TAT CAA 
Asp lie He Lys Ser Tyr He Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 25 30 

GCC TCA TAT CAT TTG TTT ACA TCT GGA GGT AAG AGG TTA AGA CCA TTA 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 AO 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 
He Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 5S 60 

TAT TAT CCA CCT GCA GCT ATT GAA GTT CTT CAT ACT TCT ACG CTT CTG 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Ser Thr Leu Val 
65 70 75 80 

CAT GAT GAT ATT ATG CAT CAA GAT AAT ATC AGA AGA GGG TTA CCC ACA 
His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

GTC CAC CTG AAA TAC GGC TTA CCC TTA GCA ATA TTA GCT GGG GAT TTA 
Val His Val Lys Tyr Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 110 

CTA CAT GCA AAG CCT TTT CAG CTC TTA ACC CAG GCT CTT AGA GGT TTG 
Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 
115 X20 12~5~ 
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CCA AGT GAA ACC 
Pro Scr Glu Thr 



130 

ATA ATT ATA TCC 
lie lie He Ser 
145 

ATT GAT ATA AAG 
He Asp lie Lys 

CCT GCA TTA TTC 
Ala Ala Leu Phe 

180 

GCT AAT GAT AAT 
Ala Asn Asp Asn 
195 

GGT ATT GCA TTT 
Gly He Ala Phe 
210 

GAA AAG CAA CTT 
Glu Lys Glu Leu 
225 

AAG ACT ATA CTT 
Lys Thr lie Leu 

AAG AAG ATT GTC 
Lys Lys lie Val 

260 

GAA TTA ATG AGC 
Glu Leu Met Ser 
275 

GCA TAG AAT TTA 
Ala Tyr Asn Leu 
290 

AAT CAA GTC TCC 
Asn Gin Val Ser 
305 



ATA ATT AAG GCT TTC GAT ATT TTC ACT CGT TCA ATA 
He He Lys Ala Phe Asp He Phe Thr Arg Scr He 

135 140 
GAA GGA CAG CCA GTA GaT ATG GAA TTT GAG GAC AGA 
Glu Cly Gin Ala Val Asp Met Glu Phe Glu Asp Arg 
150 1S5 160 

GAG CAG GAA TAC CTT GAC ATG ATC TCA CGT AAG ACA 
Glu Gin Glu Tyr Leu Asp Met He Ser Arg Lys Thr 
165 170 175 

TCG GCA TCC TCA ACT ATA CGC GCA CTT ATT GCT GGT 
Ser Ala Ser Ser Ser He Gly Ala Leu He Ala Gly 

185 190 
GAT GTA AGA CTG ATC TCT CAT TTC GGT ACG AAT CTA 
Asp V*l Arg Leu Met Ser Asp Phe Gly Thr Asn Leu 

200 205 
CAG ATT GTT GAC GAT ATC TTA GGT CTA ACA CCA GAC 
Gin He Val Asp Asp He Leu Cly Leu Thr Ala Asp 

215 220 
GCA AAG CCT GTT TTT AGT GAT ATT AGG GAG GGT AAA 
Gly Lys Pro Val Phe Ser Asp He Arg Glu Gly Lys 
230 235 240 

GTA ATA AAA ACA CTG CaG CTT TGT AAA GAG GAC GAG 
Val He Lys Thr Leu Glu Leu Cys Lys Glu Asp Glu 
245 250 25S 

CTA AAG GCG TTA GGT AAT AAG TCA GCC TCA AAA GAA 
Leu Lys Ala Leu Gly Asn Lys Ser Ala Ser Lys Glu 

265 270 
TCA GCA GAT ATA ATT AAG AAA TaC TCT TTA GAT TAT 
Ser Ala Asp He He Lys Lys Tyr Ser Leu Asp Tyr 

280 285 
GCA CAG AAA TAT TAT AAA AAT GCT ATA GAC TCT TTA 
Ala Glu Lys Tyr Tyr Lys Asn Ala lie Asp Ser Leu 

295 300 
TCT AAG AGT CAT ATA CCT CGa AAG GCT TTA AAA TAT 
Ser Lys Ser Asp He Pro Gly Lys Ala Leu Lys Tyr 
310 315 320 
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CTA GCT CAA TTT ACG ATA AGA AGG AGA AAA TAA 993 
Leu Ala Glu Phc Thr lie Arg Arg Arg Lys TER 

325 330 

SEQ ID NO: 5 
Sequence Length: 99 3 
Sequence Type: Nucleic acid 
Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG ACT TAC TTT GAC AAC TaT TTT AAT GAG ATT GTT AAT TCT GTA AAC uq 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu lie Val Asn Ser Val Asn 

5 10 15 

CAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT OCT AAA CTA TAT GAA 96 
Asp lie He Lys Ser Tyr lie Ser Gly Asp v a l Pro Lys Leu Tyr Glu 

20 25 30 

GCC TCA TaT CAT TTG TTT ACA TCT GGA GGT AAG AGG TTA AGA CCA TTA 144 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 40 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 192 
Xle Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 55 60 

TAT TAT GCA CGT CCA GCT ATT GAA GTT CTT CAT ACT CTT ACG CTT GTG 240 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Leu Thr Leu Val 
65 70 75 80 

40 CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC ACA AGA CGG TTA CCC ACA 288 

His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

GTC CAC ATG AAA TAC GGC TTA CCC TTA GCA ATA TTA GCT GGG GAT TTA 336 
Val His Met Lys Tyr Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 HO 

CTA CAT GCA AAG GCT TTT CAG CTC TTA ACC CAG GCT CTT AGA GCT TTG 364 
Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 
115 120 125 
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25 



30 



35 



40 



45 



50 



CCA AGT 
Pro Ser 
130 
ATA ATT 
lie lie 
145 

ATT GAT 
He ASp 

GCT CCA 
Ala Ala 

GCT AAT 
Ala Asn 

GGT ATT 
Gly lie 
210 
CAA AAG 

Glu Lys 
225 

AAG ACT 
Lys Thr 

AAG AAG 

» ■ 

Lys Lys 

CAA TTA 
Glu Leu 

GCA TAC 
Ala Tyr 
290 
AAT CAA 
Asn Gin 
305 



GAA ACC ATA ATT AAG GCT TTC GaT ATT TTC ACT CCT TCA ATA 4 32 
Glu Thr lie He Lys Ala Phe Asp He Phe Thr Arg Ser He 

135 160 
ATA TCC GAA GCA CAC GCA GTA GAT ATG GAA ITT CaG GAC AGA 480 
He Ser Glu Gly Gin Ala Val Asp Met Glu Phe Clu Asp Arg 

150 155 160 

ATA AAG GAG CAG GAA TAC CTT GAC ATG ATC TCA CGT AAG ACA 528 
He Lys Glu Cln Glu Tyr Leu Asp Met He Ser Arg Lys Thr 

165 170 175 

TTA TTC TCG GCA TCC TCA ACT ATA GGC GCA CTT ATT GCT GGT 57 6 
Leu Phe Ser Ala Ser Ser Ser He Gly Ala Leu He Ala Gly 

180 185 190 

GAT AAT GAT GTA AGA CTG ATG TCT GAT TTC GGT ACG AAT CTA 624 
Asp Asn Asp Val Arg Leu Met Ser Asp Phe Gly Thr Asn Leu 
195 200 205 

GCA TTT CAG ATT GTT GAC GAT ATC TTA GGT CTA ACA GCA GAC 672 
Ala Phe Gin He Val Asp Asp He Leu Gly Leu Thr Ala Asp 

215 220 
GaA CTT GGA AAG CCT GTT TTT AGT GAT ATT AGG CAG GCT AAA 720 
Glu Leu Gly Lys Pro Val Phe Ser Asp He Arg Glu Gly Lys 

230 235 240 

ATA CTT GTA ATA AAA ACA CTG CAG CTT TGT AAA GAG GAC GAG 768 
He Leu Val He Lys Thr Leu Glu Leu Cys Lys Glu Asp Glu 

245 250 255 

ATT GTC CTA AAG CCG TTA GGT AAT AAG TCA GCC TCA AAA GAA " 816 
He Val Leu Lys Ala Leu Gly Asn Lys Ser Ala Ser Lys Glu 

260 265 270 

ATG AGC TCA GCA GAT ATA ATT AAG AAA TAC TCT TTA GAT TAT 864 
Met Ser Ser Ala Asp He He Lys Lys Tyr Ser Leu Asp Tyr 
275 280 285 

AAT TTA GCA GAG AAA TAT TAT AAA AAT GCT ATA GAC TCT TTA 912 
Asn Leu Ala Glu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

295 300 
GTC TCC TCT AAG AGT GAT ATA CCT GGA AAG CCT TTA AAA TAT 960 
Val Ser Ser Lys Ser Asp He Pro Gly Lys Ala Leu Lys Tyr 

310 315 320 
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CTA CCT GAA TTT ACG ATA AGA AGG AGA AAA TAA 
Leu Ala Glu Phe Thr lie Arg Arg Arg Lys TER 

325 330 

SEQ ID NO: 6 
Sequence Length: 993 
Sequence Type: Nucleic acid 

« 

Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu He Val Asn Ser Val Asn 

5 10 15 

GAC ATT ATT AAG AGC TAT ATA TCT CGA GAT GTT CCT AAA CTA TAT GAA 
Asp He He Lys Ser Tyr He Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 25 30 

GCC TCA TAT CAT TTG TTT AC* TCT CGA GGT AAG AGG TTA AGA CCA TTA 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 40 05 

ATC TTA ACT ATA TCa TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 
lie Leu Thr He Ser Ser Asp Leu Phe Cly Gly Cln Arg Glu Arg Ala 

50 55 60 

TAT TAT GCA GGT GCA GCT ATT GAA GTT CTT CAT ACT TCT ACG CTT GTG . . 
Tyr Tyr Ala Gly Ala Ala He Glu v a l Leu His Thr Ser Thr Leu Val 
65 70 75 80 

CAT GAT GAT ATT ATG GAT Caa GAT AAT ATC AGA AGA CGG TTA CCC ACA 
His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

GTC CAC GTG AAA CAC GGC TTA CCC TTA GCA ATA TTA GCT GGG GAT TTA 
Val His Val Lys His Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 110 
CTA CAT GCA AAG GCT TTT CAG CTC TTA ACC CAG GCT CTT AGA GGT TTG 
Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu Arg Cly Leu 
115 120 125- 
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15 



20 



25 



30 



35 



40 



45 



50 



CCA AGT 
Pro Ser 
130 

ATA ATT 
He He 
145 

ATT GAT 
He Asp 

GCT GCA 
Ala Ala 

GCT AAT 
Ala Asn 

GGT ATT 
Gly He 
210 
GAA MG 
Glu Lys 
225 

AAC ACT 
Lys Thr 

AAC AAG 
Lys Lys 



GAA ACC ATA ATT 
Glu Thr lie He 

ATA TCC GAA GGA 
He Ser Glu Cly 

150 

ATA AAG GAG CAG 
He Lys Glu Gin 
165 

TTA TTC TCG GCA 
Leu Phe Ser Ala 
180 

GAT AAT GAT GTA 
Asp Asn Asp Val 
195 

GCA TTT CAG ATT 
Ala Phe Gin He 

GAA CTT GGA AAG 
Glu Leu Gly Lys 

230 

ATA CTT GTA ATA 
He Leu v a l He 

2*5 

ATT GTC CTA AAG 
He Val Leu Lys 
260 



AAG GCT 
Lys Ala 
135 

CAG GCA 
Gin Ala 

GAA TAC 
Glu Tyr 

TCC TCA 
Ser Ser 

AGA CTG 
Arg Leu 
200 
GTT GAC 
Val Asp 
215 

CCT GTT 
Pro Val 

AAA ACA 
Lys Thr 

CCG TTA' 
Ala Leu 



TTC GAT ATT TTC ACT CGT TCA ATA 
Phe Asp He Phe Thr Arg Ser He 

140 



432 



GTA GAT ATG 
Val Asp Met 
155 

CTT GAC ATG 
Leu Asp Met 

170 
AGT ATA GGC 
Ser He Gly 
185 

ATG TCT GAT 
Met Ser Asp 



GAA TTT GAG GAC AGA 
Glu Phe Glu Asp Arg 

160 

ATC TCA CGT AAG ACA 
He Ser Arg Lys Thr 

175 

GCA CTT ATT GCT GGT 
Ala Leu He Ala Gly 
150 

TTC GGT ACG AAT CTA 
Phe Gly Thr Asn Leu 

205 



GAT ATC 
Asp He 

TTT AGT 
Phe Ser 

CTG GAG 
Leu Glu 
250 
GGT A AT 
Gly Asn 
265 



TTA GGT CTA ACA GCA GAC 
Leu Gly Leu Thr Ala Asp 
220 

GAT ATT AGG GAG GGT AAA 
Asp He Arg Glu Gly Lys 
235 240 
CTT TCT AAA GAG GAC GAG 
Leu Cys Lys Glu Asp Glu 

255 

AAG TCA GCC TCA AAA GAA 
Lys Ser Ala Ser Lys Glu 

270 



GAA TTA ATG AGC TCA GCA 
Glu Leu Met Ser Ser Ala 
275 



GAT ATA ATT AAC AAA TaC TCT TTA GAT TAT 
Asp He lie Lys Lys Tyr Ser Leu Asp Tyr 

280' 285 



GCA TAC AAT TTA CCa GAC AAA TaT TAT AAA AAT GCT ATA GAC TCT TTA 
Ala Tyr Asn Leu Ala Glu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

290 295 300 

AAT CAA GTC TCC TCT AAG AGT GAT ATA CCT GGA AAG GCT TTA AAA TAT 
Asn Gin Val Ser Ser Lys Ser Asp He Pro Gly Lys Ala Leu Lys Tyr 
305 310 315 320^ 
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CTA GCT GAA TTT ACG ATA AGA AGG AGA AAA TAA 
Leu Ala Glu Phe Thr He Arg Arg Arg Lys TER 

325 330 



SEQ ID NO: 7 

Sequence Length: 26 

Sequence Typec Nucleic acid 

Strandness: Single strand 

Topology: Linear 

Molecular Type: Synthetic ONA 

Sequence 

AAGAGAAGCT TATGAGTTAC TTTGAC 



26 



SEQ ID NO: 8 

Sequence Length: 21 

Sequence Type: Nucleic acid 

Strandness: Single strand 

Topology: Linear 

Molecular Type: Synthetic DNA 

Sequence 

GATACAAGCT ttattttctc c 

SEQ ID NO: 9 

Sequence Length: 2 8 

Sequence Type: Nucleic acid 

Strandness: Single strand 

Topology: Linear 

Molecular Type: Synthetic DNA 

Sequence 

CCCCCCTCGA GGTCGACGGT ATCGATAA 



The present invention discloses a mutated enzyme comprising a geranyigeranil diphosphate synthase having its 
origin in wild type Sulfolobus acidocaldarius wherein, one of at least phenylalanine at position 77, methionine at position 
85, valine at position 99. tyrosine at position 101. phenylalanine at position 118. arginine at position 199 and aspartic 
acid at position 312 is substituted with another amino acid. 

Claims 



1 . A mutated enzyme wherein at least one of phenylalanine at position 77, methionine at position 85. valine at position 
99, tyrosine at position 101. phenylalanine at position 1 1 8. arginine at position 1 99 and aspartic acid at position 31 2 
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in a geranylgeranil diphosphate synthase of SuKolobus acidocaldarius origin, is replaced with another amino acid, 
which enzyme is able to form prenyl diphosphate having at least 25 carbon atoms, or a modified mutant enzyme 
that is modified by replacing, deleting and/or adding one to several amino acids, which enzyme maintains the activ- 
ity of the above-mentioned enzyme. 

2. An enzyme as set forth in claim 1 wherein at least phenylalanine at position 77 is substituted with another amino 
acid, 

3. An enzyme as set forth in claim 2 wherein said amino acid is a non-aromatic amino acid. 

4. An enzyme as set forth in claim 2 wherein phenylalanine at position 77 is substituted with a non-aromatic amino 
acid. 

5. An enzyme as set forth in either claim 2 or claim 3 wherein valine at position 99 is further substituted by another 
amino acid. 

6. An enzyme as set forth in either claim 2 or claim 3 wherein tyrosine at position 1 0 1 is further substituted by another 
amino acid. 

7. An enzyme as set forth in claim 1 wherein at least methionine at position 85, arginine at position 199, and aspartic 
acid at position 31 2 are substituted with other amino acids, 

8. An enzyme as set forth in claim 1 wherein at least phenylalanine at position 1 18 is substituted with another amino 
acid. 

9. A gene that codes for an enzyme as set forth in any of claims 1 through 8. 

10. An expression vector that contains a gene as set forth in claim 9. 

11 . A host transfected by an expression vector as set forth in daim 10. 

12. A process for production of an enzyme according to claim 1 1 in a process for producing an enzyme as set forth in 
any of claims 1 through 8. Claim 1 . comprising the steps of 

culturing host cells transformed with an expression vector comprising a gene coding for the enzyme of claim 1 . 
and 

recovering the enzyme. 

1 3. A process for production of a mutated prenyl diphosphate synthase comprising the step of: 

culturing host cells transformed with a gene mutated by substitution of a codon for the amino acid residue at 
fine upstream to the amino terminal of the aspartic acid-rich domain 1 with a codon for a non-aromatic amino 
acid residue so as to express the mutated prenyl diphosphate synthase which can produce longer chain of pre- 
nyl diphosphate than those produced by the original wild-type prenyl diphosphate synthase. 

14. A process for production of a prenyl diphosphate equal to or larger than those having 25 carbon atoms, comprising 
reacting an enzyme according to any one of claims 1 to 8 or an enzyme produced by a process according to claim 
12 or 13 with a substrate selected from isopentenyl diphosphate, dimethylallyl diphosphate, geranyl diphosphate, 
farnesil diphosphate and geranylgeranyl diphosphate. 
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(54) Long-chain prenyl diphosphate synthase 

(57) The present invention discloses a mutated 
enzyme comprising a geranylgeranil diphosphate syn- 
thase having its origin in wild type Sulfolobus acido- 
caldarius wherein, one of at least phenylalanine at 
position 77. methionine at position 85. valine at position 
99. tyrosine at position 101, phenylalanine at position 
118, arginine at position 199 and aspartic acid at posi- 
tion 312 is substituted with another amino acid. 
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